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We investigate the state size of DFAs accepting the shuffle of two words. We provide words u and v, 
such that the minimal DFA for tiuj w requires an exponential number of states. We also show some 
conditions for the words u and v which ensure a quadratic upper bound on the state size of u tu v. 
Moreover, switching only two letters within one of u or v is enough to trigger the change from 
quadratic to exponential. 



1 Introduction 

Since its introduction, the shuffle operation has been aggressively studied as a model of nondeterministic 
interleaving in both purely theoretical and practical contexts. Perhaps due to the intrinsic nondetermin- 
ism of the operation, many problems concerning shuffle remain unsolved; e.g., shuffle decomposition 
for regular languages (though it is decidable [|3] for commutative regular languages or locally testable 
languages while for context-free languages it is undecidable |[3l). 

We follow here the recent trend of attacking the special case of the shuffle of two words, inspired 
by attempts to solve the decomposition problem. It has been shown in fT| that shuffle decomposition 
on individual words is unique as long as there are two letters used within the words. In 0, the result 
from U was extended to show that if two words u and v both contain at least two letters, then the shuffle 
decomposition is the unique decomposition over arbitrary sets and not just words. 

In this paper we ask a different type of question: what is the minimal state size for a DFA accepting 
the shuffle of two given words? For the more general case of languages, it has been shown in that 
the shuffle of two DFAs can yield an exponential minimal DFA (£2(2 nm ), where n,m were the sizes of 
the two DFAs). We show here that DFAs accepting the shuffle of two words also require an exponential 
number of states in general; however, for words obeying certain conditions, a DFA may be constructed 
with, at most, quadratically many states. 

A striking reminder of the complexity of the shuffle is operation is illustrated by showing that two 
words which may be accepted by a quadratically-bounded shuffle DFA can only be accepted by an 
exponentially large DFA when only two letters in one word are exchanged. 
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2 Preliminaries 

Let N be the set of non-negative integers. An alphabet £ is a finite, non-empty set of letters. The set of 
all words over £ is denoted by £*, and this set contains the empty word, A. The set of all non-empty 
words over £ is denoted by £ + . 

Let £ be an alphabet and let u, v G £* . If u = a" 1 2 • • • a"" with ai, . . . a n G £, ai, . . . , a n G N and 
<2j / <H+\, for 1 < i < n, then the skeleton of u is defined as x( u ) =a\a%-- -a n . The different occurences 
of the same letter a in the skeleton of u are called the a-sections of u. Furthermore, for a G £, \u\ a 
denotes the number of a's in u. A word u over £ is called non-repeating if \u\ a < 1 for all a G £. Let 
u, v G £*. The shuffle of u and n is defined as 

u lu v = {u\V\ ■ ■ ■ u n v n | u = U\ ■■■u n ,v = v\ ■■ -v n ,Ui G £*,Uj G £*, 1 < i < n}. 

We say u is a SMjfJfo of t>, written u < s i>, if v = xu, for some x G £*. 

A trajectory for two words it and f is a word i G {0, 1}*, such that \t\ = \u\ and \t\i = \v\. Then the 
shuffle of u and v on t is denoted by um t v and is the unique string in u lu v, where a letter from u is 
used whenever t has a at the respective position, and a letter from v is used whenever t has a 1. For 
details regarding shuffle on trajectories, consult O. 

We assume the reader to be familiar with nondeterministic and determinisitic finite automata. See 
J2J HI for an introduction and more details on finite automata. For each NFA we can effectively construct 
an equivalent DFA by using the so-called subset construction 0. For an NFA with n states, the DFA 
constructed this way can have up to 2 n states. There exists a unique minimal DFA (up to isomorphism) 
for each regular language. States p and q of a DFA are distinguishable if there exists x such that 5(p,x) 
is a final state, but 5(q,x) is not, or vice versa. Moreover, if every state of a DFA is accessible and every 
pair of states are distinguishable, then the DFA is minimal Q. For both NFAs and DFAs we use size 
synonynously with state size, and, thus, we define \A\ = \Q\. 

3 Shuffle NFAs for words 

In this section we discuss basic properties of shuffle NFAs for two words. 

Definition 1. Let £ be an alphabet and let u = u\ ■ ■ • u m ,v = v\ ■ ■ • v n G £ + , where n, ,, Vj G £ for all 
1 < i < m and 1 < j < n. We say A is the naive shuffle NFA for u and v if A = (Q,L,5,qo,F) where 
Q = {0,...,m}x{0,...,n}, q = (m,n), F = {(0,0)} and 

• for 1 < k < m, < I < n, we have (k— 1,1) G S((k,l),U( m _f, + i-\); and 

• for < k < m, 1 < I < n, we have (k,l— 1) G 5((A;,/),t;( n _; +1 )). 

For all i and j with 1 < i < m and 1 < j < n we denote by Ui and Uj the suffixes of length i and j 
or the words u and v, respectively. We furthermore define LA{i,j) =U{iu Vj, which is accepted by the 
automaton A' = (Q,L,5,(i,j),F). 

Note that the automaton as defined above is not complete.lt is clear from Definition Q] that the naive 
shuffle NFA for u and v does in fact accept u lu v. 

Definition 2. Let A be the naive shuffle NFA for two words u and v over some alphabet £. The ver- 
tical layers and horizontal layers (shortly, v-layers and h-layers) are numbered 0, 1, . . . , \u\ + \v\ and 
\u\, \u\ — 1, . . . 1,0, — 1, . . . , — \v\, respectively. The vertical layer (horizontal respectively) k, contains all 
states with i + j = k (contains all states (i, j) with k = i — j). 
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The vertical layer tells us how many letters we have read thus far, while the horizontal layer tells 
us the difference between the numbers of letters we have read from u and v. Note that the initial state 
(\u\, \v\) is in horizontal layer \u\ — \v\ if \u\ > \v\, and in horizontal layer |t>| — \u\ if \v\ > \u\. 

Definition 3. Let £ be an alphabet and let A be the naive shuffle NFA for some words u,v E Z + . Let 
a G L and ii,i 2 ,ji,j 2 £ N. Then R = (a, (*i , Ji ) , (12, h)) is a nondeterministic area of A if 

I I > i\ > i 2 > 0, |u| > ji > j2 > 

and 

1. all states with i\ > i > ^2, ji > J > J2 are nondeterministic on a, 

2. if they exist, («i + l,ji) and + 1) are determistic on o, and 
3- <5((z2, j2),o) is undefined. 

The set of all nondeterministic areas of A is denoted by Area(A), and we define the entrance and exit 
states of R and the states in R = (a, (ii,ji), («2 5 J2)) as 

ent(i?) = {(ii,i) I ji >j> j2}U{(i,ji) I i\ > i > i 2 }; 
ex(R) = {(i 2 ,j) I ji >j> 32} U {(i,j 2 ) I h > i > ii}; 
states (i?) = \ i\>i> i 2 ,ji >j> ji}- 

Example 4. Let u = bbaa, v = aab. Then the naive shuffle NFA A for u and v has 

Area(A) = {(a, (2,3), (0, 1)), (b, (4, 1), (2,0))}. 

A is depicted twice in Figure |4j first with the different horizontal and vertical layers labelled and then 
with the nondeterminisitic areas shown in grey. 
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Figure 1 : Naive shuffle NFA for u = bbaa and v = aab o 
We know from Q that given an NFA accepting a finite language over a k letter alphabet with q states, 



a minimal DFA accepting the same language has at most 0{k lo ^- {k '> +l ) states in the worst case. Thus for 
a binary alphabet, 0(2^) states are both necessary and sufficient in the worst case. 

In the case of naive shuffle NFAs, it is immediately obvious that during a subset-construction only 
state labels from the same vertical layer can appear within the same state of the DFA. If \u\ = m and 
\v\ = n with < n < m, then for each number between 1 and n there are two vertical layers with that 
number of states, and there are (m — n + 1) vertical layers with (n+1) states. If we assume that for each 
v-layer, all subsets of states except the empty set are possible (it is sufficient to add the empty set once) 
then this gives us an upper bound of 



2 £ (2* - 1 ) + (m - n + 1) (2 n+1 - 1 ) + 1 = 2 n+ 1 (m - n + 3) 



m — n- 



(1) 



i=i 
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for the number of states in the equivalent DFA. Recall that the NFA has (m + 1) (n + 1 ) states, so the 

(m + l)(re + l) 

bound in (fl} is better than the bound 0(k 1 °s2( fe )+ 1 ) (where k is the size of the alphabet) from Q for 
arbitrary finite languages. 

When u and v are over disjoint alphabets then the naive shuffle NFA for u and v is also the minimal 
DFA for uiu v. This can be seen as every pair of states that are not distinguishable would have to be 
in the same vertical layer, however, every two states in the same layer have some different path to the 
final state. Thus, all pairs of states are distinguishable. So, in the worst case there is a lower bound of 
{\u\ + 1) • (\v | + 1) on the size of the shuffle DFA for u and v. 

We can also see that the bound (GQ) is not tight, as only labels of states of the NFA which have identical 
Parikh vectors can appear together as the label of a state in the DFA. Thus the bound (Q} would be reached 
only if u,v G {a}* for some a G £. But then the minimal DFA for u lu v would only have \u\ + |u| + 1 
states, a contradiction. 

Definition 5. Let u and v be words over some finite alphabet £ and let A be the naive shuffle NFA for u 
and v. A walk through A is a sequence of states sq,s\, . . . , si u \ + \ v \, where so = (\u\, \v\), s\ u \ + \ v \ = (0,0), 
and for all i with < i < \u\ + \v\, we have Si + \ G 5(si,a) for some a G £. We say that a given vertical 
or horizontal layer is visited x-times during a given walk if exactly x states from that layer appear in the 
walk. 

Note that there exists a bijective mapping between the walks through a naive shuffle NFA and the set 
of possible trajectories for the shuffle of u and v. 

Lemma 6. Let u,v be words over some alphabet £ and let A be the naive shuffle NFA for u and v. Then 
during each walk through A, every vertical layer has to be visited exactly once, while each horizontal 
layer may be visited once, multiple times or not at all. However, if\u\ > \v\ then each of the horizontal 
layers 0, 1, . . . , — \ v\ has to be visited at least once, and similarly if\v\ > \u\. 

4 Shuffle DFAs for periodic words 

In this section we focus on a special case of the shuffle of two words, namely the shuffle of two words 
that are periods of a common underlying word. Thus u = w\w k and v = W2W l , where w G £ + , w ^ a + 
for any a G L, k,l > and both w\ and u>2 are suffixes of w. At first glance one could assume that 
these words lead to an exponential blow-up in the state size when converting the naive shuffle NFA to a 
DFA, because they induce long common factors. However we will show that this is not the case when 
the underlying word w contains at most one section per letter in £. We first show two subset-relations 
between different periodic shuffles over the same underlying word. These subset-relations are then used 
to construct the DFA in a more efficient manner. 

Lemma 7. Let £ be a finite alphabet and let w = a\- ■ ■ a n for some n > 2, such that alph(w) > 2. Let 

u = w\w k , v = W2W 1 , v! = w\w k , v 1 = W2W 1 where < I < k' < k, < I < I' < k, k + l = k' + /' and 
w\ , W2 are both either empty or proper suffixes of w. Then u lu v C u' lu v'. 

Proof: Let A be the naive shuffle NFA for u and v. Let t be a trajectory for u and v. We construct a 
trajectory t' for u' and v ', such that u mt v = v! in t ' v'. 

As discussed in Lemma[6l the horizontal layers 0, . . . , \u\ — \v\ have to be visited at least once during 
any walk through A. Let p < \u\ — \ v\ be maximal such that p mod n = 0. Thus p > \u\ — \v\—n, which 
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implies, as |it| — \v\ > n, that layer p has to be visited at least once during any walk through A. Let 
p' = p — n(l' — I) (see Figure©. Then 

P > \u\ — \v\—n — I'n + ln = kn — ln + \w\\ — \w2\ — n — l'n + ln>kn — I'n — 2n> —n. 

Thus p' > — n, but as p' mod n = 0, this implies that p' > and, thus, p' is also visited at least once 
during any walk through A. 

We let be the first occurence of a state in h-layer p in unit v and we let (i',f) be the first 
occurrence of a state in h-layer p' in u \±it v. 

Then i mod n=j mod n and i' mod n = f mod n, which means that when in states and (i',f) 
we are at the same point in the underlying period w for both words u and v. Let t = htitj where t\ is 
the part of t before visiting ti is the part of t after visiting but before visiting (i',f) and £3 is 
the part of t after visiting Then | i 2 1 1 = l^lo + n(l' — 0- Now let t' = t^t-s, where ti is obtained 

from ti by switching all 0's for l's and vice versa. Then v! Lu f / v' = u LUf v. 




Figure 2: Transformation of a trajectory by switching all 0's and l's in ti. 

In order to show that the inclusion is proper, assume without loss of generality that k' > I' and 
I w\ I = I wi \ + 9> Q > 0- We define an infinite word w = af • • • a^af • • • - • • and let z = u' mt z v' where 
t z =0i(0\)\ w 2\+i'n (k'-i')n_ Then z = ZlZlZ ^ where | Zl | =g> | 02 | = \ W2 \ + l' n _ Then z 2 is a factor of w. 

But the length of factors of w that are also factors of a word in u lu d is bounded by \ w2\ +ln< \w2\ +l'n, 
hence z £ u' m v' \ uuj v. □ 

The next result is similar to the previous one, but now only the suffixes of w at the beginning of the 
words are swapped and the number of repetitions of w do not change. The proof is omitted due to space. 

Lemma 8. Let £ be a finite alphabet and let w = a\ ■ ■ ■ a n for some n>2, such that alph(w) > 2. Let 
u = w\w k , v = W2W 1 , v! = W2W k , v 1 = w\w l where < I < k and W2 < s w\ < s w. Then u lu v C u' lu v'. 

We can use Lemma [7] and Lemma [8] to show a subset-relation between the languages defined by 
certain states of the naive shuffle NFA for two words that are periodic over the same underlying word. 
This result will be useful in the next subsection to show that the minimal DFA for the shuffle of periodic 
words over certain underlying words is smaller than the naive NFA for these words. 
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Lemma 9. Let u = vo\W and v = w%w , where w = a\---a n for some n > 1 such that a\,...a n G £ 
and w\ and W2 are suffixes of w. Let A he the naive shuffle NFAfor u and v and let ,f be natural 
numbers such that 

1. 1 < i < \u\, 1 < i! < \u\, 1 < j < \v\, i<f< \v\; 

2. i+j = i'+f; 

3. {i mod n,j mod n} = {i 1 mod n,f mod n}; and 

Then L A (i,j) C L A (i',f), and L A (i,j) = L A (i',f) if and only if{i,j} = {i',f}. 

Proof: Obviously {i,j} = {i' implies L A (i,j) = L A (i' so we only have to show that Condi- 
tions 1, 2, 3 and \i — j\ > \i' — j'\ imply that L A (i,j) C L A (i',j'). 

By Condition 1 there exist suffixes IZj and Ui> of u and suffixes Uj and Vj> of v, such that 

L(i,j) = Ui lu Uj and L(i ,j ) = i_u Uj'. 

Condition 3 implies that there exist suffixes w\,W2 of w, such that = \w\w p , W2W q } and 

{?Ij/,iJj/} = {w\w p ,W2W q } for some p,q,p' ,q' > andp,p' < k and < Furthermore, Condition 2 
implies that the words in L(i,j) and L(i' all have the same length, which implies p + q = p' + q' '. 
Now we get two cases, depending on whether i and j are from the same iteration of w as i' and j' or not. 

If {i div n,j div re} = {i' div n,j' div n}, then {p,q} = {p',q'}- Then |i — j\ > |i' — j'\ implies that 
p = q' and q = p' and either both \wi \ > \w2\ and p > q, or both |ltJi | < |uJ2| and p < q. We assume the 
former without loss of generality and obtain L(i,j) C L(i',j') by Lemma[8] 

If {i div re, j div re} ^ {i' div re, j' div n} then {i, j} / {i, j'} follows immediately. Thus, by Condi- 
tion 4, we have — > \i' — j'\, which implies without loss of generality that q < q' <p and q <p' < p 
(the case where p < q' < q and p < p' < q is symmetric). But this implies that L(i,j) C L(i' by 
Lemma |7J □ 

4.1 Underlying non-repeating words 

We now show that the shuffle of periodic words over a non-repeating w yields deterministic finite au- 
tomata that have at most a quadratic number of states. 

Theorem 10. Let u = w\w k and v = W2W 1 , where k > l>0 and w = a\--- a n for some re > 2 such that 
ai = cij implies i = j whenever 1 < i,j < n and W\, W2 are non-empty suffixes of w. Ifk>l, then the 
minimal DFAfor u lu v has (\u\ + 1) • (\v\ + 1) — ^(1^1) • (\v\ + 1) — ■ (m + 1) states, where m < \v\ 
is maximal such that (|u| — m) mod re = 0. If k = I, \wi \ > \w2\, then the minimal DFAfor umu has 
(\u\ + 1)(H + 1) - • (\v\ + 1) - i?(M - \w\)(\u\ - \w\ + 1) states. 

Proof: Assume first that k > I. We construct the naive shuffle NFA 

A = (Q,Z,6,s ,F) 

for u and v. Obviously \Q\ = (\u\ + 1) • (\v\ + 1) by Definition Q] In the following we perform several 
transformations with the automaton A, so that in the end A has the properties that are mentioned in the 
theorem statement. 
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Removing j\v \ • (\v \ + I) + • (m + I) s t a t es : We look at the horizontal layer 0, which contains the 
final state (0,0) as well as the states (|u|,|u|),...,(l,l). All the states in this layer, except the final state 
are nondeterminisitic, so for all i with < i < \v\ there exists a£l, such that 

5((i,i),o) = 

By Lemma|9] we know that L(i,i — 1) = L(i — Thus, we can modify the transition function 5 to 
S((i,i),a) = — without changing the accepted language. When we have done this for all nondeter- 
ministic states in the horizontal layer 0, the states in the horizontal layers — 1, . . . , — \v\ are unreachable 
and can be removed from Q. The number of states removed in this way is * = \\ v \ ' (M + 1) ■ 
We now look at the horizontal layer \u\—m, which contains the states 

(\u\,m), (\u\ — l,m — 1), . . . , (\u\ — m,0). 

As m < \v\ is maximal, such that — m) mod n = 0, the horizontal layers 

(H-H),(H-(H-i)),...,(H-(m+i)) 

do not contain any nondeterministic states. 

Furthermore, we know that all states in the horizontal layer \u\ — m except for the state (\u\ — m,0) 
are nondeterminisitic. Thus, if we let be one of the nondeterministic states in the horizontal layer 
\u\ — m, then (i, j) = (\u\ —p,m — p) for some < p < m and there exists a£l, such that 

5((i,j),a)={(i-l,j),(i,j-l)}. 

This implies that {(i — 1) mod n,j mod n} = {i mod n, (j — 1) mod n}, as the outgoing transitions of 
both states (i — and — 1) carry the same labels and w is non-repeating. Also it is obvious that 
(i — 1) +j = i + (j — 1) and 1 < i < \u\, 1 < i — 1 < \u\, 1 < j < \v\, 1 < j — 1 < \v\. Furthermore as 
|n| > \v\, we have \(i — 1) —j\ = \\u\ — \v\ — l\ <\\u\ — \v\ + l\ = \i — (j — 1)|. Therefore by Lemma|9] 
we have L{i — l,j) C L(i,j — 1), which implies that we can modify the transition function 5 of A to 
5((i,j),a) = — 1) without changing the accepted language. Once we have done that for all states in 
the horizontal layer |n| — m, the states in horizontal layers \u\ — m + l,...,|it| are no longer reachable 
and can be removed. The number of states removed in this way is JJiLi i = hm- (m+1). 

We now have \Q\ = (\u\ + 1) • (\v\ + 1) — \(\v |) • {\v\ + 1) — \m- (m + 1), as claimed in the Theorem 
statement, however A could still be nondeterministic. 

Removing remaining nondeterminism: The only horizontal layers left in A' are | u \ — m, ... ,0. Fur- 
thermore we have already removed all nondeterminism from the horizontal layers |u| — m and 0. Also 
note that all states £ Q now have i > j and the only states with i = j are those in the horizon- 
tal layer 0. Thus, all remaining nondetermism must occur in the horizontal layers \u\ — m — 1, . . . , 1. 
However, a state is nondeterministic precisely when imodn = j mod n, which is only possi- 
ble for states in the horizontal layers \u\ — m — pn where 1 < p < k — I. Let be such a state. 
As (i,j) has precisely two outgoing transitions, this implies that there exists a letter a £ Z, such that 
5((i,j),a) = {(i — 1, j), — 1)}. As no letter appears more than once in w, we know that 

{(i — 1) mod n,j mod n} = {i mod n, (j — 1) mod n}. 

Thus, as i > j implies that \ 1) — j\ < — which implies, by Lemma|9l 



L(i,j-l)CL(i-l,j). 
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We can, thus, redefine 6((i,j),a) = (i— 1, j). 

Showing minimality: We can show that A is minimal by induction on the layers. The details of this part 
of the proof are omitted due to space. 

If k = I, then there are fewer than \w\ horizontal layers between the initial and final state and the 
proof has to be changed slightly. The proof for this case in omitted due to space. □ 

Example 11. Let u = bc(abc) 2 , v = abc. Then the naive NFA for u lu v is shown on the left side of 
Figure [3] According to the proof of Lemma [lOl we can remove all the shaded states and transitions and 
we can furthermore also remove the dashed non-shaded transitions. This then leaves the minimal DFA 
for u lu v, as shown on the right side of Figure [3] o 
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Figure 3: Naive shuffle NFA and minimal shuffle DFA for u = bc(abc) 2 and v = abc. 

From the proof of Theorem [10] it is immediate that we can construct the minimal shuffle DFA for 
periodic words over a non-repeating underlying word directly without first constructing the NFA. 

4.2 Periodic words with one section per letter 

We now generalize Theorem [10] to underlying words the skeletons of which are non-repeating. That is, 
we still consider only words u = w\w and v = W2W 1 , where k>l>0 and w\ and W2 are proper (possibly 
empty) suffixes of w. However, w no longer has to be non-repeating, but we now have w = a\ l •■• a V n 
for some n > 2 and positive integers p\,...,p n and where a\ ■ ■ ■ a n is non-repeating. 

Theorem 12. Let £ be a finite alphabet and let w £ S + , such that \ w\ =n>2 and for all a £ £, we have 
\xi w )\a f) 1- Let u = w\w k and v = W2W l where w\,W2 are suffixes of w and k,l > 0. Then there exists 
a DFA A with L{A) = u lu v and \A\ G 0(\u\ ■ \v\). 

Proof: Let A' = (Q' ,L,5' ,Q' ,F') be the naive shuffle NFA for u and v. Obviously 



\A'\ = (\u\ + l)(\v\ + l). 
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We show that for each nondeterministic area R G Area(A'), we can determinize R in such a way, by 
using Lemma |9j that no state in the DFA contains more than one label from ex(i?), and no more than 
0(| states (7?) Uex(i?)|) contain labels from states (R) Uex(R). 

Let R = (a, (ii, ji), {12,32)) G Area(^4') and let (i,j) G ent(72). When determinizing 7? by using 
a subset construction it is easy to see that if both states (i' , f) G Q' and (i" ,j") G Q' can be reached 
from (i,j) by reading A; a's for some A: G N, then also all states (i,j) with i +j = i' + f and either 
both i' < i < i" and / > j > j" or both i' > i > i" and j' < j < j" can be reached from (i, j) by 
reading k a's. Furthermore if some state (i',j') can be reached from (i, j) by reading k a's, then also 
some state (i'" ,j"') G ent(7?) Uex(7?) can be reached from (i,j) by reading k a's. This implies that at 
most 2 j states (7?) Uex(7?)| states can result from a subset construction on R, assuming that we are starting 
with states that contain only individual entrance state labels. 

It is also obvious that each state q obtained by performing a subset construction on 7? contains at 
most 2 exit state labels (as there are only two exit states of 7? per vertical layer). If there is at most 
one exit state of 7? in q, then q does not induce any states with multiple labels outside of the states in 
states(Ti) Uex(7?) and we are done. If q contains distinct exit states (i',j r ) and (i",j") then there exists 
an n G N, with 1 < n < {i\ — 12) such that (i',f) = {12 — 1, 32 + n) and (i" ,j") = {12 +n, 32 — 1) (or vice 
versa). But then, as 12 mod n = j% mod n, we know that {i' mod n,f mod n} = {i" mod n, j" mod n}. 
Furthermore we know that i' +f = i" ' + j" and either \i' — j'\ > \i" — j"\ or \i' — j'\ < \i" — Thus by 
Lemma we have either L A (i',f) C L A (i",j") (if \i' -f \ > or L A ,(i" ,j") C L A ,(i',f) (if 

\i" ~ j"\ > \i' an d, hence, we can remove one of (i',f) and (i",j") from q without changing the 
accepted language. 

Thus, the nondeterministic areas do not induce any states with multiple layers outside of the nonde- 
terminisitic areas, which implies that |y4|GO(|u|-|f|). □ 

5 Exponential shuffle automata 

Theorem 13. Let £ be an alphabet of size at least 1. Then there exist words u,v G T + , \u\ = \v\, such 
that the size of the minimal DFA accepting u i_u v, is 

Note that, in the proof below, the numbering of the layers is different from the numbering used thus 

far. 

Proof: For n > 1, let 

u n = (aabb) n aabbaabb(aabb) n aaaaa,v n = (aabb) n aabababb(aabb) n bbbbb, 
X n = a(aabb) n aaa(bbbbaaaa + bbbabaaa) n+l bbbb(aabb) n aaaaabbbbb. 

Let A n = (Q, Z, qo,F, 5) be the naive shuffle NFA for u n and v n . We have A2 pictured in Figure [4] 
Let m = \v n \ = \u n \ = 8n + 13, and there are 2(8n + 13) + 1 = 16n + 27 vertical layers. For each 
layer i, let Qi be the set of states in that layer. Let qij be the jth state (along the diagonal) in the ith 
layer. There are i states in the ith layer for i < 8n + 14 and (8n + 14) — (i — (8n + 14)) = 16n + 28 — i 
for Sn + 14 < i. For each w which is a prefix of some word in u n lu v n , let Q w be the set of states 
5(qo,w). We will only consider input words in X n . In Figured we have the set of states Q w , with each 
state denoted by bullet points. 

We show by induction that for each i, 1 < i < n,Q a ^ aabb y = {q4i+2,j,Q4i+2,j+3 \ j = 2+4/,0 <l<i}. 
This is the "duplication stage", consisting of the states in the shaded top left corner of Figure H] The 
details of this part of the proof are omitted due to space. 




Figure 4: The diagram is the naive NFA A%, with the top left corner as the initial state, the bot- 
tom right corner being the final state, and the lines of the grid being transitions on the letter la- 
belling the axis, with 112 along the horizontal and V2 along the vertical axis. The input to A2 is 

a(aabb) 2 aaa(bbbbaaaa)(bbbabaaa)(bbbbaaaa)bbbb(aabb) 2 aaaaabbbbb, with active states marked with 
bullet points. 



Thus, after reading a(aabb) n , we are in one of the states in 

Qa(aabb) n = { a 4n+2,j, Q4n+2,j+3 \ j = 2 + 41,0 < I < Tl] . 

This occurs at the bottom diagonal of the "duplication" section in Figure @] Then 

Qa(aabb)"aaa = {Q4n+5,j I j = 3 + 41, < / < n} 

which is of size n+ 1. The next set of input letters is in (bbbbaaaa + bbbabaaa) n+l . This is the so called 
"filtering stage", marked in white in Figure [4] Intuitively, each element of Q a (aabb) n aaa> as determined 
by I, will continue roughly along a diagonal (we get a diagonal for / being 0, 1,2 in the figure) until each 
reaches baba along v marked by the "prune" line of the figure. If the input is then bbbbaaaa, this diagonal 
gets "cut off", while all other states in the vertical layer are able to continue along its diagonal. However, 
if the input is bbbabaaa, then every diagonal in the vertical layer is able to continue. Since each diagonal 
reaches the "prune" line at a different time, we can selectively keep or remove each diagonal one at a 
time. 

More formally, assume that x\---x n+ \ is the input, Xi £ (bbbbaaaa + bbbabaaa) . Let 7Tj = if 
Xi = bbbabaaa, and 7Tj = 1 if Xi = bbbbaaaa. The sections of A2 when reading xi,X2,x^ are separated 
by lines in Figure |4] where 7ri = 1,7T2 = 0,7T3 = 1. We can then show by induction that for each i, 

\<l<n+l, Qaiaabb^aaaxv-Xi = {<?4n+5+8i,j | j = 3 + 41 + 4l, < I < n, (I < i 7T; = 1)}. 

The details of this part of the proof are omitted due to space. 
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Hence, Q a (aabb)™aaaxi-x n+l = {<?4n+5+8(n+i) ,j I 3 = 3 +4/ + 4(ra + l),7Tj = 0}. No matter the contents 
of this set, which depends on x\, ■ ■ ■ ,x n+ \, every state can reach a final state on bbbb(aabb) n aaaaabbbbb 
since the rest of u is of the form bb(aabb)* aaaaa and the rest of v is of the form bb(aabb)*bbbbb. There- 
fore, if we use the subset construction (H on A n , there is only one set of states we can be in after read- 
ing each prefix of a(aabb) n aaa. As we read each prefix w of x\ ■■ -x n+ i,w = x\ ■■ -x^y, \y\ < S,Xj S 
(bbbbaaaa + bbbabaaa),j < i, then <7 4 n+5+8i,3+4;+4i G Q a (aabb) n x v -xi if and onl y if 1 > 1 or n = °- There 
are 2* such subsets. And indeed, if \y\ > 4, then #(<74 n +5+8i,3+4i+4i;y) i s undefined if and only if 7Tj + i = 1. 
Hence, after reading each prefix of length 1 to \x\ ■ ■ ■ x n+ \ | , there are 

3 + 8-2 1 +8-2 2 + -- - + 8-2 n + 5- 2 n+1 = 3 + 5 • 2 n+l + 8(2 ! + • • • + 2 n ) 
= 3 + 5 • 2 n+1 + 8(2 n+1 - 2) = 13(2 n+1 ) - 13 = 13(2 n+1 - 1) 
sets of states created in the subset construction. Thus, when reading every prefix of 

a{aabb) n aaax\ ■ ■ ■ x n +\ , 

4(n + 1) + 13(2 n+1 - 1) sets of states are created and thus the subset construction requires at least this 
many states, and the remaining input is of length 4(n + 1) + 10, the automaton from the subset construc- 
tion has at least 8(n+ 1) + 13(2 n+1 - 1) + 10 states. 

We can now show that the minimal automaton created from this subset construction automaton re- 
quires this many states as well, by showing that there are at least this many distinguishable states (51 . The 
details of this part of the proof are omitted due to space. Hence, we get Q(v^ n ) where \u n \ = \v n \ =m. 

□ 

Theorem [13] is especially interesting in light of Theorem [121 which showed that the minimal DFA 
for the shuffle of u = (aabb) 2n+2 and v = (aabb) 2n+2 is in 0(n 2 ). It is easy to see that adding 5 a'a and 
5 6's to the ends of these words does not change this bound. The words used in the proof of Theorem [T3l 
differ from these u and v only by switching two letters in one of the words, and yet this subtle change is 
enough to cause an exponential blow-up in size. 
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